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Abstract 


In this thesis we desciibe the design of a low bit rate video codec based on 
an aibitraiy shaped region based approach Unlike conventional region based ineth 
ods the region shape information is not transmitted in the present approach, as it 
can be synchronously obtained by segmenting the reconstructed picture at both the 
encoder and decoder lespectively A local decoded picture is divided into several 
segmented regions and moving regions are selected based on the frame diffeience 
The algorithm used for spatial segmentation is a multiscale gradient algorithm fol 
lowed by the watershed tiansformation which provides accurate segmentation at 
very low computational cost A novel corner detection and tracking approach is 
used for robust motion estimation Coiner points aie used to represent each moving 
legion and estimation of then motion through tracking is used to chaiacterize the 
motion of each moving region A Least squares method is used to estimate the 
motion parameters The estimated motion is used to piedict the next frame using 
motion compensated prediction Finally an efficient method for coding prediction 
error is also proposed The algorithm developed was tested on standard sequences 
A data rate between 10 20 kbps was obtained at a frame rate of 7 5 frames/sec The 
PSNR obtained ranges from 38db to 34db for different sequences indicating good 
quality of reconstructed images 
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Chapter 1 


INTRODUCTION 


1 1 The need for video compression 

The digital representation of an iinige, oi i secjiu lui of nn ig( , icciuiits a very laig( 
iiiiinbd of bits llu goil ol image (oding is to icdiiu tlu mimbti of bits as iimdi 
as possible , and to le (oust me t i fiilhfnl duplie lie of the oiigiiiil pie tine 

The efficient digital representation of image and video signals has been the 
subject of corisidei able rcseaich o\er the past 20 years Digital video coding tech 
nology h is devcleipcd into i in ituit held and pioducts have been developed that are 
txigeted for a wide lange of emeiging applications, such as video on demand, digital 
1 V /HD rV bro id casting md multimedia imagc/video data base services With the 

increased comineicial mteicst in video communications the need for international 

( 

image and video compression standards aiose 

To meet this need, the moving Picture Experts Gioup(MPEG) was formed 
to develop eocliiig stand uds In leeent tunes MPEG i and MPLG-2 video eod 
mg standards have atti acted woild wide attention, due to an increasing number of 
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\ay laij^e scale intcgiation (VLSI) and software iinpk mcntitions oflhese standards 
becoming corniiKKi illy ivuhblc Ml’LG 4, (lie niosl i(((nl MPL& stinclnd foi 
tiinsnntting video signals it vciy low bit rites is motivated by its poteiitnl ap 
pile itioiis for \ idco phones video confeieiicing multimedia eleetioiiic mail remote 
sensing clcetionie newspapers mtei active miiltmiedia ditabases multimedia an 
iiotitioii suiveilliuee telemedicine eommuiiic ition uds foi deaf people and many 
otheis 


Ihe nnm huidle m tlie implement it ion ol these ipplications arises from the 
difheulty in compiessmg huge amount of visual information to meet the available 
b indwidth range In \idco phone or video confeieiicmg application the baekgiound 
of the scene lein nns uneh iiiged m eonsecutive li lines and only few moving regions 
are piesent So there is lot of ledundant information which is icrnovrblc As a 
result the video sociueiicc cm be icpiesented by r low bit i ite streiiu winch cau be 
tiansinit tod over (e leiiliono ne twoik whose binelwuKh is of (lie oieiei 64Kb/soc We 
discuss some of the eomnion video si uieluds m the lollownig see turn 


1 2 Existing Video Standards 

1 IIU r H261 Video Coder 

The irU II 261 Video Coding Standard came into existance rn early 80 s 
Ihis video coder piimanly aims at achieving bit i rtes of p x 64Kbps, where p vanes 
from 1 to 30 Motron compensation is employed to predict the images DOT and 
VLC arc also used llie mam applie it ion die Video telephony and Video Confer 
rencmg 

2 MPEG 1 
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MPEG 1 was tlie next m line aftei H 261 Many storage media and telecom 
inunic iLion dianiicls like LANS etc are well suited foi a bit rate of 1 5 Mbps MPEG 
1 satisfies these lequiienicnts In addition it is ilso the first standard to jointly im 
pleinent both Nidco and audio coders It cncouiagcs a lot of interactive applications 
I( doesn’t specify my st mchid encoding piocess Hence the compression algoiithrn 
is left to the desigiiei But the image quality should be coinpaiable to that of a 
VCR and audio qu ihty to that of a CD 

3 MPEG 2 

MPEG 2 IS in extension of the MPEG 1 Inteinalioiial staridaid for digital 
eompiession of \ideo and audio signals During 1990, MPEG recognised the need 
for 1 second, ichted st iiidaul foi coding video for bioadeost formates at higher 
data rates MPEG 2 stand ud is cap ihle of coding stand ird definition television at 
bitrates from about 3 15Mbps and high definition television at 15 30Mbps It sup 
poitb a lot of inld i( (ivc aiiiihc at ions like Digital stoi age me ch i, compute i giaphits, 
multimedia etc Ihis generic coding standard is designed to provide a wide spectium 
of bit latcs, 1C solutions, (jii ihly 1( vc Is uulscivitts ilic ulv intagc of (Ins sytciii is 
it’s compatabihty and scalability with other systems The four compatabilities are 
defined as 

• A system is called as backward compatible if an existing decoder can decode 
a signal encoded by the new encoder 

• A system is foi ward compatible if a new decoder can decode a signal encoded 
by the present encoder 

• The system is upward compatible if a higher resolution decoder is able to 
decode the signal encoded by the present system 
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• A sysifiii IS do\\n\^ \tcl compitilde if t. 1ow( i usoliiLion decoder is iblc to 
decode the bit stieain or part of the bit stream produced by a higher lesolution 
cncodei 

MPre 1 /n 2()1 U( foiwiid iiid iipwiid ( 01111 ) liihh witli MPFO 2 The 
standard IdPEG 2 is brekw ud iiid dowiiwud compatible with MPEG 1 /II 261 
As for IS uulio IS coiicdiud MPEG 2 is only loiwvid uid bvckwiid compatible 
Scalability means tint a pait of a bit stream can be decoded This allows decoders 
with less pioccssmg powers to display video at lowci icsolution/quality The video 
standiid MPEG 2 is s( il ibh 

4 MPEG 1 

MPEG 4 was staited with a view of achieving bit lates lower than G4 kbps 
to enable the tiansmission of video and audio through Public Switch Telephone 
NcLwoiks Ihis is how MPEG 4 (\olvcd b(giiuiing hum II 2G1 Hr hiiil diilt is 
just i( leased 


1 3 Very Low Bit Rate Video Coding 


A New interest has been recently arisen among the image coding research communrty 
m the field of Very Low Brt R ate Image sequence coding 1 he motivation of such an 
interest lies m the developcment of new applications such as video phones, mtci active 
multimedia databases, remote sensing and many others The key requirement foi 
these applications is the low capacity for transmission or stoiagc, in order, to use 
existing commnnualion networks or foi mobile eommunieation 

For many yeais waae foiin based image coding approaches have been the 
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only appioaclies utilized in image coinpiessioii But they presented severe limitations 
foi vei} low bit rate low bit r rte video coding applications In order to overcome the 
limit itions imposed by first geneiatioii image coding techniques, second generation 
image coding was formally introduced in 1985 This second generation approach 
consist of taking into iccount the clnricteiistic of the human visual system to 
define the coding scheme As a result of iiieludiiig the huirian visual system, second 
gonei ition can be also seen as an appioadi of seeing the image coinposcd by dilfeient 
tntilies e riled objects Ihis implies that the image or sequence of images have fiist 
to be analysed and/or segmented in older to find the entities 

Methods used m second gcnei rtioii coding can be mainly dcvided into two 
ertegoi les 

• Segmentation based schemes 

• Model b rs( d sehomes 

13 1 Segnieiiiaion-based schemes 

Among the difleidiL coding appioaclies grouped under the name of second geneia 
tion coding techniques, there is an mciease of interest in segmentation based image 
coding appi caches Tliese coding methods divide the images into a set of connected 
legions so that every pixel in the image is related to one, and only one, region The 
set of regions form a partition of the image Each region m the partition recieves 
a diffeient label Images are desciibed in terms of the partition is well as the m 
formation related to the interior of each region (e g texture, motion, etc) This 
information is neccessaiy to reconstruct the image in the reciever The reason foi 
the curient interest in segmentation based coding approaches is mainly twofold 


Fust they aie the basis toi the new efhcient coding methods Being a second 
generation technique segmentation based coding approaches tiy to eliminate the 
redundant infoimition within and between fi lines taking into account the special 
piopcities of tin hum in visu i! system Iiipuliculii the segmentation proccduie 
yield a putition wliosi regions aio lioinogc iiious in some sense Due to this ho 
mogemety, the iiifoiniation of each legioii can be sep ii itely coded in a veiy elFicient 
inaunei 

Secondly, the} open tin door to new luiietioii ihtu s ni the coding selicine In the 
fiainework of video coding, new coding schemes allowing functionalities such as 
content based multimedia d ita icccss etc Such function ihties demand a dcscrip 
tion of the iniige sequences m terms of objects winch can be grouped into aieas of 
mteicst bj (he user 


13 2 Model-based schemes 

In model based image coding the input image is viewed as a 2 D piojection of a 
3 D ical woild (scene) Iho coding is peifoiined by Inst modeling the 3-D scene, 
extract mg the model parameters at the encoder and finally synthesizing the image at 
the decoder by using the extracted and quantized parameters If we can reconstruct 
the three dimensional scene model that leads to 2 D image sequence, and the images 
are analyzed and synthesized based on this model, then a great reduction in image 
information can be expected This is the idea of Model — based coding method 
However they can be used for a very limited range of scenes 

Though these coders offer the promise of faiily good quality images at low 
bit rates, the complete system is still under study and the coders designed require 
considerable computational resources[19] Need for a good model is very important 
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since the quilii> of the iini^e leconstiuctiou depends to a lart,e extent on the 
model rinthei moie the lepoited systems are limited to a particular model such 
as the head and shoulder image and a statioiiaij- back giound A system using a. 
model of Claire einnot provide good results foi the Miss America sequence 


1 4 Objective of the thesis 


In this thesis oiii ob)ec five w is to build a Low bit lato PC BASED VIDEO CODEC 
foi bit lates in the i uig( of 20kbps oi less Inspired by dilfcicnt techniques used 
to achieve low bir rates a new algoiithm has been developed and described m this 
thesis Using the nev method leconstructed images of good quality was achieved 
foi a bit rate of 10 20kbps 

B isie illy the method developed is a segmentation-based coding system in 
volvmg tliice stages 

1 In the ill St stage Segmention is earned out that splits the original data into var 
lous homogeneous eoniponcnts such that eaeli eomponent coi responds as much as 
possible to semantic units Morphological based Watershed Segmentation Algorithm 
has been used for this purpose Mathematical morphology is indeed atti active for 
this puipose because it is a geometrical approach to signal processing and easily deals 
with ciiteria such as shape, size, eontrast, connectivity, etc Moieovcr, Mathemat 
ical morphological transformations can be efficiently implemented in both softwaie 
and liaidwaie llus is of prime importance, because the major bottleneck m the 
segmenation based coding schemes is the complexity, accuracy and computational 
load of the segmentation stage The Watershed Algorithm however can potentially 
provide anaccurate segmentation at low computational cost[3] 

2 The second stage is the Motion estimation Although the segmentation based 
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video coding techniques rely on the concept of arbitrarily shaped legions the mo 
tion estimation methods used in H 261 MPEG 1 etc rely mainly on block matching 
techniques The problems associated with conventional block based approaches are 
a) Block distortion due to block based processing in prediction and in Transform 
coding frequently appears at low bit rates because the prediction errors cannot be 
encoded sufficiently b) Some times appearance of frozen blocks can cause serious 
distortion 

In the present algorithm Low level features are used for motion estimation The low 
level features such as the corner points of the image are extracted and their position 
is tracked between frames Feature based approach reduces the vast amount of data 
present in an image without necessarily eliminating salient information Low level 
descriptors are preferred foi important reasons like generality and graceful degrada 
tion 

3 In the third stage coding is carried out This consists of coding the prediction er 
ror and motion information The standard JPEG coding scheme for prediction error 
coding IS not suitable for very low bit rate apphcations(10 20Kbps) because it takes 
around bOkbits to represent the error information itself In this thesis we propose a 
new coding scheme employing standard DOT and quantization for prediction error 
coding 


1 5 Organization Of The Thesis 


This thesis is organized as follows 


1 Chapter 2 This chapter gives an overview of the adopted approach to design 
the low bit rate video codec 
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2 Chapter 3 This chapter describes the Multiscale gradient based watershed 
segmentation algorithm for accuiatc segmentation The use of Multiscale gra 
client algorithm in enhancing the blurred edges over noise is discussed An 
efficient algorithm for the removal of small local minima is explained This is 
followed by a discussion on implementation of Immersion algorithm to calcu 
late the fast watersheds 

3 Chapter 4 This chapter discusses a new motion estimation algorithm based 
on the control point tracking approach The selection of moving regions from 
the segmented regions of the previous frame and 3 D motion estimation of 
each moving region inteims of eight motion parameters is discussed 

4 Chapter 5 This chaptei discusses an improved version of standard JPEG 
method to code the error image 

5 Chapter 6 Results obtained at each stage of the coder are discussed and 
PSNR of diffeicnt image sequences are compared for different bit rates 

6 Chapter 7 This chapter concludes the thesis and discusses the scope for 
future woik 
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Chapter 2 


APPROACH ADOPTED IN THE 
DESIGN OF LOW BIT RATE 
VIDEO CODEC 


llie approach adopted, in the design of the low bit rate video (< 20kbps) encoder 
and dicodci, utilizes i hybiid coding scheme ol motion compensated piediction 
and traiisfoim coding The diffeience between conventional H 261, H 263, MPEGl, 
MPEG2 stuid ud coding irulhods uid the pioposcd algoiitliin is the use of icgion 
based methods to obtain aibitraiy shaped legions of an image for coding instead of 
square block stiuetuie norinally used 

The encoding pioccss is illustrated in Piguie 2 1 A local decoded picture is 
devided into seveial segmented regions in the segmentation block Moving regions 
aie selected from the segmented regions based on a frame difference calculated in 
the changed pixel detector, between a current input frame and a previous decoded 
frame The 3 D motion of each moving legion is estimated inteims of eight motion 
parameters Then a predicted pictuie is constiucted by motion-compensation pre 
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diction for c\ery inoMiig region Finally the significant prediction eiror regions are 
encoded by a modified tiansforin coding algoiithm The pioposed method to encode 
the eiioi pictuic is found to be suitable foi low bit late ipphcations In addition 
the estimated motion paiameteis aie encoded using uniloirn quantization and fixed 
length coding 

The decoding process is shown in Figure 2 2 Decoder lecicves the serial bits 
and decodes it to iccover the eiioi and motion inloimilion Ilie previous decoded 
pictuie IS segmented using method used m the encoder The predicted pietuie 
IS constiucted by motion compensated piedietioii bised on the decoded motion 
paiameter iiifoini itioii Finally, the piedietion ciior is added to the predicted picture 
to get the reconstiueted oiiginal fi uiie 

This appioveh is inheicntly much nioic snnplei than othei approaches de 
scribed in the literature The iriain advantage is that it is not necessary to send the 
boundary mfoimation of the mam regions which m leality takes the largest fraction 
of the band width in other appro lehes Using the method described a bit rate as 
low as 9 2Kbps has been achieved for a SNR of 37db 


I 
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Figuie 2 2 Block diagram of Decoder 
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Chapter 3 


SEGMENTATION 


IliG goal of im ige segmentation is to partition an image into hoinogeiicous icgions 
and locate the contouis of the regions as accurately as possible Hence segmentation 
IS as essential step in anj Region based method 

The segmentation methods in general can be classified into three gioups 
thresholding, edge based and region based segmentation 

1 Thresholding lliiesholdmg lepiesents the simplest image segmentation 
method, and is computationally inexpensive and fast Global thiesliolding i e 
using a single threshold for the entne image is successful omly under special 
eiicumsi iiKCS wlun gi ly level viii ilions m m image is small In other eases, 
segmentation methods using vaiiable thresholds, that is the threshold value is 
varied over the image as a function of the locil image chaiactciisties, can be 
a better choice some extent Diffeient threshold detection methods exist to 
determine icspective thresholds automatically Threshold based segmentation 
techniques v ill fail if the assumptions made in the detection of thresholds are 
not tiue 
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2 Edge based Edge based segmentation relies on edges found in an image by 
edge detection opeiatois These edges mark the image locations of discontinu 
ities in gioy level coloi, textuie etc The most common pioblem that aiiscs 
111 many edge detection technique is that the edges that ue detected are false 
At some tunc s dc t( ( ted edges often have gajis between them at loc itioiis where 
(he tidiisition between regions aie not abiupt enough So detected edges may 
not neccessaiilj foim a set of closed connected euives that soiiound eonneeted 
legions 

3 Region based In these methods, image segment ition is earned out dividing 
the linage into diffeient legions The different methods are region merging, 
legion splitting and split ind irieigc One of tin drawbacks with the legioii 
e\ti action method is that it processes the image in an iteiative manner and 
usually requires large computational time and memoiy 

Recently morphological based watershed algoiithm has been developed for 
image segmentation Malhc malieal inorpiiology is indeed Utiaclivefor ihesegmena 
tion puipose because it is a geometrical appioach to signal processing and easily deals 
with critena such as sh ipe, si/e, contiast, or eonuecLivily that can be con&ideied as 
segmenation oriented features [4] Morphological appioach to image segmentation 
combines icgiou gi owing xnd edge detection teihmques It gioups the imige pixels 
around the region minima of the image and the boundaiies of adjacent groupings 
are precisely located along the crest lines of the gradient image This is achieved by 
a transformation called watei shed ti ansf ormaUon Watershed Transformation is 
a powerful tool foi image segmentation winch can potentially provide accurate seg 
mentation with very low computational cost as campxred segmentation algorithms 
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For rlie low bit Rate \idco codecs using segmentation based methods, accurate seg 
menation of a frame is an important lequiremeiit In the proposed video codec design 
we used the hloiphologic il b ised W atci s}icd Scg7iic7]tatiun ilgorithm[2] 

Foi iinigt segmentation watershed tiansfoimalion starts with the gradient 
of the image to be segmented It views the gradient image as a three dimensional 
(3 D) suiface wlicie the gradient values act as sin face heights Intensity edges m 
the image to lx scginciiti d g( rui illy luv( high gi idient v dues which will ippcai as 
watershed lines on the 3 D suiface while the iiiteiior of each region usuilly has a 
lowgi ulitiit value wliidi is coiisuh ud is x (afclivirnt hasznonthe3 Dsinfue The 
wateishcd lines partition the gradient image into different catchment basins which 
correspond to homogenous regions of image to be segmented Wateished tiaiisfoi 
mation involves a search for wateished lines in the gradient image Theiefore, the 
perfoiinance of a watershed based image segmentation method largely depends on 
the algorithm used to compute the gradient 

Comentional gradient algoiithms exhibit a serious weakness for watershed 
based image segmentation A conventional gradient opeiator such as the first order 
partial derivative of Gaussian filter and morphological gradient operators produce 
too many local imiiima because of noise and quaiitiaatioii error with in homogenous 
legions Each miniinua of the gradient introduces a catchment basin of the watershed 
transformation As a result, these gradient operators result in over segmentation 
le homogenous regions are partitioned into a large number of regions and proper 
eoritoiiis are lost 111 i mullitiuh oflilsi oiks 

We choose Multisc ik giidunt ilgoiithm [3] based on moiphological oper 
ators to reduce the oversegmentation This algorithm efficiently enhances blurred 
edges such that their gradient values increase above those caused by noise and quan 
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tization erioi llien by using an algoiithrn to elunmate the local nimima and giving 
this multiscale giadient image as input to the watcished transform We got good 
usults 



Figuie 3 1 (a) Minima, catchment basins and wateisheds (b) Building dams at the 
places here the water coming from two catchment basins would merge 

3 1 Morphological tools of Interest 

In this section we briefly desciibe some morphological tools of mteiest for the total 
segmentation algoiithm 

Dilation Let f(x,y) denote input image and b(x,y) the structuring element The 
grey scale dilation of f by b, denoted by / © 6, is defined as 

(/ © b){s,t) = max{f{s - x, t - y) + b{x,y)\{s - x),{t ~ y) e Dj,{x,y) G Db} 

Erosion If f(x,y) is the input image and b(x,y) a structuring element, then Grey 
scale Erosion of f by b, denoted by / © 6, is defined as 


IG 



(/ © = vnn{f{s + x,t + y) - b{i,y)\{b + /) {t + y) e Dj, {x y) e D(,} 


Geodesic dilation A geodesic dilation involves two images a maikei image and a 
mask image Let / denote the marker image and g the mask image such that f < g 
The geodesic dil ition of size 1 of the maikei image f with respect to the mask image 
g is denoted by and is defined as the point wise minimum between the mask 

image and the elemcntaiy dilation of the marker image It can be expressed as 

= '*“’(/) ^ 3 

Geodesic erosion The geodesic ciosion is the dual tiaiisfoimatioii of the geodesic 
dilation wih icspcct to set coniplerneiitation 

4‘H/) = V g 

where f > g and is the elementary eiosion Hence, the marker image is first 
eroded followed by the point wise maximum with the mask image 
Reconstruction by erosion The icconsliuction by (losioii of a mask iiruge g 
fiom a marker image f {f > g) is defined as the geodesic erosion of / with respect 
to g until stability is i cached It is denoted by Hg{f) 

Rl(f) = 4'H/) 

where i is such that eW = 

Geodesic Distance Id A C 2^ Ik a sc t wlucli is the giay scale image and 
domain Da, x, and y are two points of A The geodesic distance dA{x,y) between 
two pixels X and y in A is the infmiuin of the length of the paths which join x and 
y and are totally included in A 
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P path bet\\een x and y which is totally incudcd in ^4} 


Geodesic Influence Zone Suppose the set A contains a set B made of several 
connected components D 2 Bk The geodesic influence zone of a con 

nected component B o{ B in A is the locus of the points of A whose geodesic 
distance to 5, is smaller than their geodesic distance to any other component of B 

^za{B,) = {p € e [l,A]/{2},c/a(rhA) < 



Figuic 3 2 Geodesic influence 7one of connected component By inside set A 


3 2 Multiscale Gradient Algorithm 

Ideal step edges do not exist in natuial images since eveiy edge is blurred to some 
extent A blurred edge can be modelled by a ramp and the intensity change between 
two sides of the edge is rcfciied to as edge height For a lamp edge, the output of 
a conventional gradient opeiatoi is the slope of the edge as shown in figure 3 3 
Hence, the ramp edge cannot be separated from noise and quantization erior by 
thicsholdmg, if the slope of the edge is small So a suitable gradient operator for 
wateished transformation is chosen as the one whose output is equal to the input 
edge The general morphological gradient operator is given by 

G(/) = (fe B) - (f e B) (3 1) 
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Where © and © denote dilation and eiosion opeiatms respectively and B is called 
stiuctunng element This giadient opeiator is lefened to as moiioscale gradient 
opeiatoi Its peilormance depends on the s ze of the structuring element B If B 
IS laige the output of this gradient operatoi foi ramp edge is equal to the edge 
height But laige structuiing elements result in serious inteiaction among edges 
which may lead to gradient maxima not coinciding with the edges However if the 
structuring element is \eiy small this gradient oper itoi has a high spatial resolution, 
but produces a low output value for ramp edges In older to exploit the advantages 
of both large and small striictuiing elements, a niiiltiscale moiphological giadient 
algorithm is proposed [3] 




Piguie 3 3 Output of conventional giadient operators 
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Let B foi 0 < 2 < 71, denote a gioup of square structuring elements The 
size of B, IS (2z + 1) x (2z + 1) pixels Then Moiphological multiscale gradient is 
defined 

^ EKU ® B.) - (/ e B )) e B -,| { 32 ) 

For a step edge the operation ({/ © B,) — (/ 0 B^)) © Bi_i produces a 
line of two pixel wide which coincides with the edge and intensity (height) of the 
line IS equal to the edge height lienee the niultiscile giadient is equivalent to a 
monoscale giadient opeiator in this case, but is moie lobust to noise due to the 
averaging operation used in the ilgonthm Foi lamp edge of width w and height 
/i, the opention ((/ © B,) - (/ 0 B,)) O Bi_i piodiucs i line coineiding with the 
edge The eioss section of the line appeus as a tiapczoid foi small i and as a 
triangle otherwise[3] The width of the bottom side of the tiapezoids or triangles is 
always equal to re + 2 pixels with heights always gi eater than the edge slope h/w 
[3] Theiefore, the multiscale gradient responds effectively to ramp edges without 
enlarging edges 

3 3 Elimination of small local minima 

A small local minima is defined as that local minima consisting of a small number 
of pixels or having a low contrast with its neighbours This kind of local minima 
m gradient images is gonoially r uistd by noisr oi quanli/alion ciioi, and thucfoie 
should be eliminated sueh loeal minima aie eliminated by dilation with a squaie 
structuimg element JS, of 2 x 2 jiixels eU noted by (MG(/)) ® Bg [3], and adding a 
constant h to the dilated gradient image The local mimmas with a contrast lower 
than /? can then be filled up using the reconstruction by erosion of MG{f) from 
{{MG{f)) © Bg + h) Consequently the final gradient image can be expressed os 
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Figuie 3 4 Elimination of small local miimna 


3 4 Watershed Alogorithm 

Ihe coiiupt of wdicrnhcdn and (aldnncnt banuH aio wdl known in topogiapliy 
The Noith American Continental divide is a text book example of a watershed line 
with catchment bisiris foimcd by the Atlanta and Pacific Oceans In the fcild of 
Image processing, giaj scale images aie often considered as topographic reliefs In 
the topographic representation of a given image the numerical value of each pixel 
stmds loi the elcntion at this point 
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Let us considei the topoyapluc representation of a grey level image Let a 
diop of w ltd fall on sueh a topogr iphic surface Aceoiding to the law of gra/itation 
it will How down ilong the steepest slope path until it leaehcs a mmimmn ihe 
entire set of points on the suiface whose steepest slope paths reach a given minimum 
constitutes the catchment basin associated with this minimum The water sheds 
aie the zones dividing idj leent eatchnient bisms 


3 4 1 Some Definition in teims of flooding simulations 

The definition of wateisheds in terms of water flow is not suitable for well- suited 
to an algorithmic implement ition as theie me many eases whcie the flow diiection 
at a given point is not detcirnmable (e g flat legions ele) For digital functions, there 
exists no rule to set up the path a drop of water would follow [2] Hence a definition 
mterms of flooding siinulitions that alleviates these pioblcms is used 

Consider again the grey tone image as a topogiaphic surface and assume 
that holes have been punched in each legional mmiina of the suiface The suiface is 
then slowly immersed into a lake Starting fiom the minima at the lowest altitude, 
the water will progiessively flood the catchment basins of the image In addition, 
dams are erected at the places where the waters coming from two different minima 
would merge At the end of this flooding piocedme, each minima is completely 
surrounded by dams, which delineate its associated catchment basin The resulting 
dams coiiespondb to tlu witei sheds ihey provide us willi a putition of the input 
image into its different catchment basins 

To simulate this immersion procedure, we start fiom the set Th^ (/), the 
points of which being the first to be leach by the water These points constitute the 
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starting set of our recursion We thus put 


A,, =D„ (/) (3 3) 

\/, IS in i(l( of tin points of I wliuli belong to the imiiini x of lowest 
altitude Let us noxx considei the threshold of I at level + 1 i e , Ty^ „+i(/) 
Obviously \h^ C 'lf^ +i{I) Now Y being one of the connected components of 
//i +i(/), thcie aip three possible inclusion relations between Y and Y fl „ 


1 Y n A/i =0 In this ease, Y is obviously a nexv ininimuin of I 

2 y n \/i / 0 and is connected m this eisc, Y eoiicsponds (\actly 

to the pixels belonging to the catchment basin associated with the minimum 
y O A/i^ and liixing gray level loxvei oi cciual to hmtn + 1 

y = ft„ (34) 


3)0 \h„ ^ 0 and is not connected xvc notice th it Y contains different 

minima of I Denote by Zi,Z 2 ,Zz, ,Zk these minima, and let Zi be one of 
them At this point, the best possible choice for is given by the 

geodesic influence zone of Z^ inside Y 


Ch^ +\{Zi) — izy{Zi] 


(3 5) 
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Since dll the possibilities ha\e been dicussed, we take the second set of oui 
Kcursion as follows 


\n +1 =mmn^ U IZj, +,(/)( \/> ) (3 6) 

This relation holds for all levels of h So the set of catchment basins of the gray 
scale image I is equal to the set A/,„ obtained from the following lecursion 

6) V /i. G l^max 1]) -^/t+1 ~ mZ72/,+ i 

The wateisheds of I coiiespond to the set of points of £>/, which do not belong to 
any catchment b ism 


Xt, 1 



Figuie 3 6 Recursion relation between Xh and Xh+i 


3 4 2 Implementation of immersion algorithm 

Implementation of immeision algorithm is divided into two paits 

Sorting In order to have a dinct access to pixels at i given level, the fust step 
consists in an initial sorting of the pixels in the iiici easing ordei of their giey values 
Flooding Once the pixels li we been soitcd, we piocccd to the progicssivc Hooding 
of the catchment basins of the image Suppose the flooding has been done upto a 
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given level li Evci} cdtdiment basin already discovered i e , every catchment basin 
whose coiiespondmg minima has an altitude lowci or equal to h is supposed to 
have a unique label Because of the initul soitmg we can now access the pixels 
of iltitudc li+l diiectlj ind given them i special value say MASK Those pixels 
among them which have m ihcady libelled pixel as one of their iieighbouis aie 
put into the queue Staitmg from these pixels the queue structure enables one to 
extend the labelled catchment basins inside the mask of pixels having value MASK, 
by computing geodesic influence zones Alter tins step, only the minima at level 
liH-l have not been reached Indeed they aie not connected to any of the labeled 
catchment basins Iheiefoie a second sc iniimg of the pixels at level h+1 is necessary 
to detect the pixels which still have i value MASK, and to give a new libel (o the 
discovered catchment basins The process continues until the highest gray level 
IS scanned Only the pixels which are cxictly ”hilf way between” two catchment 
basins are treated as watershed pixels The algorithm for fast watersheds is given 
in Appendix A 

In this way a meaningful segmentation of the fiaine is achieved At the 
end of this piocess we have the information about the number of regions identified 
and pixel label Ihis mfoimatioii will be necessary throughout the coding process 
because we’ll be dealing with the motion of segmented objects(regions) rathai than 
individual pixels 

Figures 3 7 show the result of seginoiitation of a particular frame in CLAIRE 
sequence and 6yl7L5A/A A sequence icspcctivcly 
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(a) (b) 

Figure 3 7 Segmoutation results for a) CLAIRE midge and b) SALESMAN image 
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Chapter 4 


MOTION ESTIMATION 


Motion aiilysis is an inipoitant step in image analysis The three main existing tech 
niqucb foi motion cstim ition can be categorised is fcituie based methods, that 
extract image features and tiack these features from frame to fiame, secondly the 
gradient based methods tint use spitnl and tempoial paitial deiivativcs to csti 
mate image flow at each location m the image and thirdly, the correlation based 
methods that use siinilaiitj m biightness patterns to determine motion vectors We 
choose the feature based appioach as offers several advdntages[7] Firstly, feature 
extraction reduces the last amount of data present in an image, without necessarily 
eliminating salient mfonnation It can use either low level diseriptors or high level 
discnptors as basic matching elements Low level approach is generally preferred 
for Its generality and graceful degradation [7] Low level discnptors may be classi 
fled as either region based oi edge based oi point b iscd Ihe point features are most 
suitable as matching tokens In the motion estimation algorithm developed in this 
ehaptei we choose coiiiei points as bisie mitehing elements 
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4 1 Moving Region Selection 


Moving regions are selected from the segmented regions of the previous de 
coded pictuie and the curient input pictme The regions in which the percentage of 
pixels which aie diffeient between the previous and the curient frame, is larger than 
a thieshold [MOV THset to 5 in oui case] aic niaiked as moving regions through 
a flag which is set to one A plot of the number of moving regions , as identified 
frame wise is shown in figure 6 4 


4 2 Cornel Detection 

After the selection of the moving legions, We find the coirioi points in the curient 
frame Corner point extiaction is the bisic step for motion estimation Figure 4 1 
gives the total blotk dngi uii foi Motion (sliiii ilioii A (oiiiti point is ddiiied as a 
point where, both the image cuivaturc iiid the gi idunt peipcndiculai to the edge, 
must have a local maxima and each exceed a spotihc thieshold Ihis is known as 
the Wang Brady criterion for corner detection [7] 

The conditions for corner detection are as follows 

C = ” ‘5'|| V/|p — > max , where C is the curvature 

G>R 


The four parameteis used m the above ciiteiiori can be understood as fol- 
lows 

S', a measuic of the surface curvature , 
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R a threshold for the coiner response 
E, an cdf^e strength measure 

AI defining the \\indo\\ si/o o\ii which the local in ixiina should be obt lined 
The values for these pai mieteis in oiu simulation are 

jS = 0 4, jR = le5, = 5 (i e 5x5 window) 

Pi 101 to qiplying Ihc above ciitei i for coiiici del icLioii, it is iinpoitaiiL to 
apply a smoothing filter over the image 1 his reduces noise reduction and suppresses 
of false comers Wc liivi used x Cxussixn window of MEAN = 0, and vaiiance 
SIGMA =1, foi smoothing The lesults of coiner point extraction for various images 
are shown in hguie G 5 


4 3 Corner matching and Tracking 

After coinei detection, corner matching and tracking an important step in the 
estimation of motion paianieters is earned out Oui objective is to match the corner 
points in two consecutive le the current and the previous frame, such that they 
can be used for motion estimation Suppose the miteher lecieves as input two giey 
scale images, Ji and I 2 , along with thier respective corners and respectively 
(where 1 = 0,1,2, n 1 and j= 0,1 ,2, n 1) Its task is to match Xi to X'^ This is 
a the well known correspondence problem , with worst case enumeration as nn 
possible matches Coiisti iints Iiom physic il woild iie usually iiiiposed to leduee 
the computational load They are 

• Each coinei may pair with only one other corner 

• Small seal eh neighbouihoods aie utilised since corners do not move for between 
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consecutive fidinc&(thc In^li fi uuc iite ensuics small mter Irame motion) 

• Local imaj^c pitch con elation is employed to vciify matches 

To match a coinei point A, m Jj all the corner points in I 2 lying in the 
seaicli window centcied at the position of Aj arc candidates for the match Correia 
tioii lb then pcifoimcd between the imigc ugion soiioundiiig the coiner At iiid the 
linage region sorioundmg each candidate corner in I 2 The winning candidate is one 
with the highest coiielation value Cmax, provided Cmax exceeds a specific threshold 
lo coirelate an image region T with an image legion P, each of size (W x W), the 
coiielation measuie is given by, 


„ - EtE , p.u,h 

\/ir^ n E, 1 i+k ]+l Pi+k j+l)^ 

(4 1) 

where (t, j) indexes the centre of the pixel block and size W = 2w+l pixel 
wide and P, j aie the respective intensity values, and k and I range fioin —w 
to +tu inclusive 

The inatcliing piocccds is follows 

Each coinei point , while it lives, has the following information 

a) Curieiit position , and positions in the previous two frames 

b) Frame numbers , telling when the point fust occuied and last occuied 

c) type of point ( 1 e , sliong match, Weak match 01 No match, fiom previous 
liainc) 

d) confidciKe of nii((h(i c , (oruhtion with the picvious fiamo) lliis is icfciicd 
to as corner history m the block diagram of Motion estimation, figure 4 1 
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• &TDP1 \ssuming the motion of the comer point is restnctorl to a 8 x 8 
window, we find out those corner points in the previous framc(with sorrouiidmg 
legion of si/e '") X 5) wliieli hive i eoiielilion with the eoiiui points in the 
cuiient fidiiie highei than a threshold M IN JJORR[= 0 6 in our case) 

• STEP2 Now icpeat step 1, foi eoinci points flora the cuirent frame to those 
111 the pievious fiariie 

• Si DPS All those points, which h ivc a two w inateh aic labelled as STRONG 
nuUhes and ue sloied m a list of eomer iioints ilong with til the leciiuied 
iiilorniatioii( see ibove ) llie points ui euiiont tiaiuc, which have no match 
in the prcMous frame are labelled is New eornci points and stoied in the list 
The points in pievious fiaine, which have no match in the curient fiaine, can 
be classified as eitliei vanished foi a short time, or permanently disappeared 
To piocess these, we go to stop 4 

• STEP4 If we assume that the point in pievious frame has vanished only in 
the current fiame, then we can predict its motion from the last two values 
and use tint to find a pseudo eoiiiei point foi it m the euiuiit hame, le 

I 

based on piediction, we tiy to locate a point in the current frame whuh his a 
correlation value liighci than a threshold with the eornei point in the previous 
frame We choose a predictor based on constant velocity model The finite 
dilferenee form foi tins predietoi is 
A(Ad 1) = 2X[k) - X{k-1) 

If such a pseudo coinei point is found, then it is labelled as WEAK match 
and stored in the list If no such point can be found for the corner, then it is 
labelled as "disappeared” If the point has lemamed disappeared over the last 
two flames then it is assumed "dead” and is deleted from the list of corner 
points 
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The abo\e tiacking of corner points is done for each new frame It results, at 
any particular instant m a set of corner points of various types The number 
of cornel points increases for the first few frames, but then stabilizes, as the 
death rate of the points catches up with the bath late 


4 4 Motion Estimation and Prediction 


We emploj a 3 D model for motion estimation and prediction It is based 
on the following hypothesis 

1 Moving regions foim a plane iii the image plane 

2 The motion is KpicsenUd by i 3 D liiicu ti insloiiii ition 

3 The projection from 3 D space to 2 D image plane is a cential projection 
Each segmented legion can be lepresented as a planar object given by 

axd by+c/ =1 

Any kind of motion can be represented by a combination of a lotation vector R and 
a tianslatioii vectoi V 
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Ihe relation between 3 D coordinates {x,y,z) and 2 D coordinates 
IS described by central pi ejection as 

A = Ff Y = 

Z 

where F is the distance from the centre of projection to the image plane 
Using the above expressions , the motion on the image plane (X Y) -)• {X' ,Y') is 
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desciibed as 




where, 


aiA+a;>l +a.a (M \+a.,l +ar, ] 
ayX+agK+l ’ ayX+agl'^+l / ’ 



ai = 


R^ 1 +qT 1 
■R33+CU 


^2 


R^ 2 +bV^ _ (i^i3+cVi)F 

-R33+CV3 3 H33+CV3 


a 4 


a? 


-^21+0^2 ^ _ R 22 +bV 2 _ {R 23 -^cV 2 )F 

•R 33 +CV 3 5 J^J 22 _|_cV 3 6 

.J? 31 +aT 3 _ -R32+6K3 

(/i!33+cV3)F 8 (i?33+cV3)F 


The eight variables oi ~ 03 are motion parameters Here (AT, Y) indicate a 
source position in the cuirent frame, and (\ ,Y ) rcfcis to a souice position in the 
previous frame The eight paiaineter set (ci - os) defines a motion parameter and 
clidiactcii/(s the motion of 1 icgioii 


4 4 1 Motion Estimation 

We need to cstimite the motion iiaiimeteis for exch rtgioii, fiom the previous 
decoded frame and the curient fiame This is done by using a least square fit of the 
representative points of the region, m the two fiaines Each region, at any instant, 
IS represented by its coiner points(assuming a rigid 3 D body) Hence these points 
are used for the least square fit 

First of all, we estimate the motion for only those regions which have been 
previously dcchicd ns moving regions This saves a lot of computation especially in 
the e^isc of an head xiid sliouldei im xgc as both display the same motion, and lest of 
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the back giound lem iins static Next, if a region has many corner points, then we 
put 1 maxirnuin bound on tlie number of points to be used for motion estimation, 
and moreover, choose only the points with maximum confidance values (piioiity is 
given to STRONG matches) 


Ihc block ch igi un of the motion csiiiiiLioii pioccss is shown iii big 4 1 

Now that we have got our points for each moving region, we estimate the motion 

MO 1 ION ES 1 IMA 1 ION 



Figure 4 1 Block diagram for Motion Estimation 


paramcteis (given above) as follows 


Let \,Y iiid \ ,4 repicseiit the pixel position, m the previous and the 
current fiame respectively Then the two coordinate sets are related by Eq(4 2) 
The erioi function, at a pixel, is defined as 


e 


2 

t 



UiAt + 02^1 + <13] 
ttyXi + ClsYi + 1 


+ 




a^Xi + ci^Yi + ttel ^ 

ayX, + asY, + l. 


(4 3) 


The cumulative Erior function is given by 


34 








^ — £r=0^i(^7^i +<78^1 + 1)*^ 


Heie the mdnidnal pixel errois have been multiplied by weights le the 
denoiniiidtor teim of Eq (4 3) because it enables us to conveit the non linear equa- 
tion to a line 11 one thus permitting a smiplu solution We hive assumed that these 
weights not laiy much in a segment such that thr solution will be close enough to 
the optimum least squaie one 

Let L be the niatiix 


L 


\i El 1 0 0 0 -A(yYi 

0 0 0 X, El 1 -r/Xi -y/Ei 


0 0 0 -x'Xn 

x„ y„ 1 -y^y„ 

0-2 Os O 4 O 5 06 (X^ Og 

n a; y' 

Then the eiror function E can be expiessed in terms of these matrices as follows 

E = Ilia - D\f 

In Older to minimize I? as a function of ^ , we compute the Pseudo Inveise of L , 
which IS dchnod is lollows 

If the number of points is greater than 3 , then the Pseudo Inverse is 

L+ = {L'^L)-n'^ 

and lie get a unique solution gueii by 


Xn Pn 1 
0 0 0 


and A and B be defined as below 




Oi 
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4 = L+B 


Otherwise if the numbei of jioints is less tlian 4 then Pseudo Inverse is 

L+ = 

In this case, multiple solutions exist and the iinininuui noini solution is given by 

A = L+B 

Since, in this eise we would like the motion ji u umkis to be closest to the ” No 
Motion ’ ease i o rii = n, — 1 ind icst /eio, so vc siibstitulo a\ = fii — 1 and 
= <^5 ~ 1 Hence L remains same and is replaced by 

Yl-Y, x:,-Xn F'-ru 
In this way we calculate oi ~ for each moving region 

4 4 2 Motion compensated Prediction 

Ihe predicted pietuie is constructed fiom Eq (4 2) using the estimated motion 
parameteis for cieli ugioii Bieiuse the pixel loc ilioii cileiilaUd fiom above is 
fractionally precise, the pixel value located between sample points is calculated by 
bihncai interpolation 

Uncovered and oveilapped legions aie eiused because the picdietioii is 
based on regions segmented at the previous picture The displacement vectors for the 
pixels m these regions are interpolated from adjacent motion parameters according 
to the distance fiom the region boundaues[l] The difference between the current 
input frame and piedicted frame is called prediction error Coding of prediction 
enor is explained m next chaptei 
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Chapter 5 


PREDICTION ERROR CODING 


5 1 Intioduction 

Prediction erroi is defined as the difference between the current input frame and a 
predicted frame Refeiring to the encoder given in Figure 2 1, it is observed that 
this error typically arises in a head and sliouldci image because of the following 
reasons 

1 The motion around the regions like ejes, mouth etc in a typical head and slioul 
der image cannot be sufficiently represented by motion parameters Consequently 
errors may be generated m these regions 

2 Large errors may be generated around the segmented region boundaries due to 
regions uncovered by the motion compensated piedictioii and inaccuracies in the 
motion parametei estimation 

So most of the predicted error is concentrated only around the face and boundaries 
of moving regions Taking caie of these errois is very important for exact segmen- 
tation of the next fiame and faithful reconstiuction of the original frame at the 
decoder 
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Coding the enoi imigc bj st uidard methods like JPEG is not suitable for 
very low bit late applications because this error nifoiniatioii itself requires 48kbits 
loi its icpicseiitatioii Since most of the eiior is coiicontiatcd uouiid the eyes and 
mouth and the boundanes of moving regions, there is no need to code the total 
error image So wo idoptod a new approach to encode the eiror picture by taking 
advantage of the fict that only few 8x8 blocks of the eiioi imigc have significant 
ciror and the icsl of the blocks iic ciioi ficc 


5 2 Method used 

Prediction erroi coding algonthin hist divides the total eiior image into 8x8 blocks 
After that it operates on a block by block basis In the present approach we are not 
coding e^ ery 8x8 block in the error image The significant blocks m which the total 
absolute error is greatei than a threshold (say Tl) are counted as significant error 
blocks and coded 

A two dimensional DCT is peiformed on each significant eiror block The 
magnitude of each DCT coefficient indicates the contribution of a particular com 
bmation of horizontal and vertical spatial frequencies to the significant error block 
The coefficient corresponding to zero horizontal and vertical frequency is called DC 
coefficient The DCT output data is an 8 x 8 block of transform coefficients with 
the DC teiin at the top upper left coiner The other 63 coefficients are AC terms 

DCT does not directly reduce the number of bits required to represent the 
significant error block For an 8 x 8 block of pixels, DCT produces an 8 x 8 block 
of coefficients llie reduction m the number of bits follows from the observation 
that, for typical blocks the distribution of coefficients is non uniform The transform 
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tends to concentiate the energy into the low frequency coefficients and many of other 
coefficients aie near ?eio The bit rate reduction is achieved by not considering the 
lieu /eio coefiieientb ind by quiiitizmg and coding the leniaining coefficients as 
dc&ciibed below llie non uniform coefficient distribution is a result of the spatial 
ledundancy piescnt in the original image block 

By cxpenmeiitition it is found that a threshold of Tl=800 is a reasonable 
value to select sigiiiiu uit eiioi blocks in ui ciiot iinigo llie libel Block-Run is 
used to ropioscnt the number of blocks passed before eiicounteimg the significant 
error block As only sigmfieuit enoi blocks nc cock cl, it ic duces the computational 
time as well is the total number of bits needed to encode This is very important 
for a low bit late coder If the number of significant error blocks exceeds a threshold 
T2 {20 in our case) then there is a definite need to send an intra frame In that case 
we send the total image coded using standard JPEG method 

5 2 1 Quantizing Scheme 

Tor quantization, we use the standard JPEG quantizing scheme There is always a 
trade off between pictuie equality and degree of cjuantuatioii A laige quantization 
step size can pioduce unacceptably laige image distortion Unfortunately, finer 
quantization leads to lowei compression ratios The question arises how best one 
can quantize the DCT coefficients most efficiently As human visual system response 
has natural higli liccjiiciicy loll off, these licquciicics play a less iinpoitaiit lolc than 
low fiequencies This allows one to use a high quantization step size for the high 
frequency coefficients, with little noticeable image distoition 

The quantization matrix is a 8 x 8 matrix of different step sizes, correspond- 
ing to each DCT coelficicnt Step sizes will be smaller in the upper left part(low 
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ficquciicies) and large in the upper right part (high fiequencies) The quantizer 
divides the DC2 coefficients bj its coric&pondmg quantum then rounds 't to the 
lie xicst nitegci L ngc qu uitunis drive sin ill coeffic lents to zero T he result is many 
high ficqucncy coefficients become /eio and theiofoie casiei to code The low fre 
quency coefficients undeigo only minor adjustment 

5 2 2 Compiession Technique 

As mentioned ( uliei, m the piesent iiutliod we me not toding eaeh and every 
block 111 the error pictuie We are coding only significant error blocks which have 
sufficient erioi Ihe number of blocks ( without sufficient error ) passed before 
cncouiiteimg the significant error block is the Block Run So each significant error 
block IS associated w ith a Block Run Subsequently we apply DCT and quantization 
to each significant error block followed by ZigJZag scanning of the quantized DCT 
coefficients and coding 

Zigzag Scanning 

After the quantization of each significant error block, it is not unusual for more than 
half of the DCT coefficients to be equal to zero Since most of the non zero DCT 
coefficients will typically be concentrated in the upper left hand corner of the matrix, 
it is apparent that a zig/ag scarming pattern will tend to maximize the probability 
of achieving long runs of consecutive zero cocfliueiits Zigzag scaiming pattern is 
shown m the figuie 6 10 Phis ordering puts the lowest fieqiiency coefficients first 
followed by the high frequency coefficients which typically have close to zero values 
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Encoding 


Lack sigiuhcdiit tnoi block is a&sociited with a BlockJlua as peviously mentioned 
lliib BlockJlun lb umtoiraly quantized and encoded The dc and ac terms in each 
ciioi blocl aie then sepu itely coded as in standaid JPEG coding 

Blocl -Run IS unifoimly quantized and encoded using fixed length coding 
We allocated 8 bits to represent this information As regards the DC coefficient 
encoding, the absolute values of dc coefficients are not directly coded This is be 
c xuse the DC coelhcicnts of neighbouiing blocks typically do not vary Consequently 
fewer bits xic needed to encode the dilferences between the successive DC coefficients 
rathei than the absolute values So the diflerence between the DC coefficients is en 
coded DC coefficient difference is encoded into two parts, some what like exponent 
and mantissa One pait is the mxgnitude extegory Ihe magnitude category M of 
an inlcgcr N is the smxllest intcgci M such tint 
2^ > INI 

The magnitude category M is Huffman coded and M bits appended to it to identify 
a particular inenibei of the category For example if the difference is 5, it is in 
category 3, which comprises the set { 7, 6, 5, 4, 4, 5, 6, 7} Now this category 3 
IS Huffman coded as 100 using standard JPEG dc coding tables and then additional 
3 bits le 101 arc appended to it to represent a paticular number in the category So 
total code is 100101 to icpresent the DC coefficient 

After coding the dc difference, the xlgoiithm codes the bloek quantized AC 
coefiieunts ilu (luaiiliztd at (odlKKuls an /ig/ag st iiined to loini a sequence of 
[Run, Size) pairs Beginning at the upper left hand corner we scan the coefficients 
m zigzag fashion The numbci of zcio values passed before encountering the non 
zero \aluc is the Run count and the nonzero value is the Level The Size is the 
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magnitude category of Level So the magnitude categoiy/Szze of integer Level is 
the smillest nitegoi Sip’c sucli that 

> \Ltvd\ 

Now this {Bun, Si c) pin is IIufFinaii coded using st indard IPCG ac coding tables 
and then "Size number’ of bits are appended to identify the particular member of 
the ludgiiitude eitcgoiy Size Tins proeedme is repeated foi all the significant enor 
blocks m the erioi image 

St ind lid TPEC coding takes appioximatoly SOKbits to code the entire error 
image lollowing the impioved method of coding only the signihcant eiioi blocks 
we achieved a compiession of about 50 1 which is very important for low bit rate 
coding 
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Chapter 6 


RESULTS AND DISCUSSION 


hi tins ch iptci cviluili the pciloinidiicc of the Region based Low Bit Rate 
Video Codoe d( \oloped 1 he peitonnanee is tested using standard sequences such as 
l)Chne sequence 2) Siksnnn uid also with i sequence taken m tlic lab uid titled 
as the ” Author se eiue iiee^ ’ 1 ho si/o of the niiagos use d in all the sequoiiecs were 256 x 
250 with a frame i etc of 7 5 fiaines/see The ilgoiithin developed has sueeesslully 
reconstructed the fust 10 frames of Claire, Salesman and Author sequences The 
original and reconstructed frames are shown in figures 6 11 to 6 40 

As discussed m the previous chapters, the low bit rate video codec essentially 
consists of the follerw mg stages 

• Segmentation 

• Motion e St iiiul ion 

• Coding 

Using (Ik s( iiiel iiel seeiiuiiees, we hive been ible (o sue eessfnlly se giiie n( each 
frame, identify eoinei points, and track them over the frames Using the corner 
points, we cxtiaet motion iiifoimition and predict the next frame The predicted 
fi line IS then used to c ilculite the prediction error, which is then effieiently coded 
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At the eUcodci \\( \ist tlusciioi uul the niotiejii uiloitn itioii Lo pe iloun llie lecon 
stale Liou ol the emiciit ti ime 

The' ATiious stejis imohccl -irc biieflj di&cusseel fiom the irnpleinentation 
point of view iiiel the lesults obtauieci “iie shown for i paitieular fiaine of cacli 
scejueiicc 

6 1 Segmentation 

llu Wateisheel li insloinulion is cxpliine'd ni clnptci 3 his been nnpltiiicntcd for 
linage segment itioii The chectivoiicss of the image segmentation iiictliod based on 
watcished tiansfoinntion is limited by the quality of the gradient image Watershed 
ti mfoun bised on comentioinl gi idient opciatois results m over segmentation 
Bee arise of this e leli homogeneous legion is paititioiied into a large number of regions 
uid piupoi eoiitoms aie lost in a multitude of false ones This can be observed fiom 
the segment ition icsults of the Clan e and silesman images obtained using watershed 
algonthm bised on comentional moiphological giadient opeiatoi, shown m figures 
G 1 and G 2 



(a) CLAIRE Inngc (b) Over Segmented CLAIRE 

rigiiu G 1 O\oi Segmentation Due to Conventional Gradient Operator 
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(i) SAIESMAN hmgc (b) Ovei Segmented SALESMAN 

iMguro 6 2 Waltishcd Segmentation uith Conventional Giadient Operator 


Ihe o\ei segmentation is due to the local minima caused by noise and quan 
ti/ation error Bj thresholding, it is not possible to eliminate these local minima 
while inescrving those prodmed b} blrnird edges Multi scale gradient algorithm 
discussed in section 3 2, has been employed to alleviate this problem by efficiently 
enhancing blurred edges lliis enhancement mere ises the gradient value for blurred 
edges above those caused bj noise and quantization error Consequently the local 
mirnraa produced by noise and qumtizalion error are eliminated using the algoritm 
discussed in section 3 3 In tins procedure, a constant ’h’ is added to the dilated 
gradicnl innge iiicl local minima with i lonli ist lowci thin li is lillecl using the 
reconstruction by ciosion opeiation as discussed m section 3 3 flie paiameter ’h’ 
contiols the numbei of segmented icgions As ’h’ mcieiscs, the numbci of regions 
pioduced dccreiscs Figure 6 3 shows the effect of varying ’h’ on the segmentation 
process 
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(c) Segmentation ^vilh li=25 (d) Segmentation with h=40 

Figni c 6 3 Varntion of Segmentation with parameter ’h’ In the Multiscale Gradient 
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Wxt(is]u(l ti xii&foiuiation usinj, IMultiscxlo {^ladient algorithm results m a 
incunngful segment itioii of u fnine without rcquiiing any further region merging 
lias IS obstned horn the following segiiKiit it loii usults ol CLAIRE and SALES 
MAN nil igt shown m fig G 1 





6 2 Motion Estimation 


riic Motion (Stun ilion pioccdme explained in chdpterl his been implemented Af 
1( 1 s( piiK III It ion nioviiif, k pious no soh c Ud lioin tin sc pine nted icpions explained 
in section 1 1 A plot of the luinibci of iiioviiip u pious is idcntihccl fiiitio wise is 
shown in fig G 5 



I iguic 0 5 Nunibci of hloving Regions against Fiame Number 


Aftci seleetioii of moving icgions eorner points aic identified in each frame 
The eorner point extraetion method explained m section 4 2 is implemented The 
foul parameters to bo specified m the coriici detection ircS RE and M The mask 
sue M controls the extent over which the cornei position must be a local peak and 
indiK c t ly dc hiu s ( lu iiiiiuinum sc p u ilnlK y be twee n c oiiieis 1 he 1 ugcr m isk lejects 
the high ficquency intensity variations (noise) and reduces clutter A mask size of 5 
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piovcs sitisiitton for our upphcution The parameter S supre&ses the corner corner 
icsi)ons( ilon[^ b> suhtiuting i fi u tion of tin edge sticngth luciei&ing 

S beyond 0 3 0 1 Ins little effect on edge lesiionse suppiession, since my comers 
Kiiniiiiiig dong the edges must iiise (loiii sigiiificint surf ice curvaUiie changes 
Ilowcvei IS S tends to infinity useful cornois begin to disappear A balance must 
bo btiuek between S ind R S must be suflhciciitly Urge to cancel out spuiious edge 
itsiionso, but bcjoiid th it, it simply intioduccs an ’ offset” which R can compensate 
foi Wc limit S to the \alue 0 4, and set R = 0 A value of E = 10000 works 
well Figuic 6 6 shows the lesults of comer point extraction for the CLAIRE and 
hAI I HMAN s( (iu( IK ( s i( spe c live ly 



(i) ( I AIUl Imige (b) Coiiiei De tee Lion in CLAIRE 
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(i) SALLSMAN Image (b) Comer Detection In SALESMAN 

1 it,uu G G Couui (JctLctioii in i lidiiic 


Aftci fiiuliiif, the coinei points of current and previous frames, our objec 
tivo IS to match the comer points in the two fiaines The procedure for corner 
tiackiiig iiid nnUliing is explained in section 4 3 is implerncnted In matching the 
corner points of piCMous frame with the current frame the winning candidate is one 
with the highest condition value c,nai . piovided Cmax exceeds a specific threshold 
MINJJOllR in Dill case This threshold is necessaiy since the ’’best" corner m the 
window need not be a valid match, e g the true future may have disappeared In 
our ( ase wc sc t MIN COBR^i) G T Ins woikecl well foi GI AIRE sequence, but for 
SALESMAN, bcc lusc of the outlicis the Ic ist squ uc me Ihod foi eilculatmg motion 
pai aiiH I c I s (lid not ( oil v( 1 g( So w( s( t il/ / /V C f .1 /f/i* I) 8 i n t his c isc Bii t tins is 
not a solution A robust method for rejecting outliers is needed Figure 6 7 show the 
cornel point ti iclung fiom pnvious fi uric to cuiicnt fi une loi a putieuhi frame of 

the AmHORstqwme 
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1 lom (Ik t oiiu i point ti itking mfoim ilion motion pr irru tc is iie extracted 
uul us(d to piodu t (lu next fi nne The Icist squaios method is used for estimating 
llu motion pu niK t( IS 


0 3 Coding 

Picdutiou (1101 winch is (lu dilkiciuc bdwtcii ciiiidit li mu ind picdieted fiaine 
IS encoded m mi cfneient wat suitable for low bit late applications In addition 
motion pii nncttis aie encoded using fixed length coding 

6 3 1 Piediction eiior coding 

In Pudietion cum coding stage we first divide the cnoi image into 8x8 blocks 
Subscciucntly we operate on a block by block basis In this method, not all error 
blocks aiG coded Ihc sigmhcaiit eiior blocks in which the total absolute error is 
gi cater than a tlncshlod (say Tl) are only transfoiin coded using the DOT By 
experimentation, we found Tl=800, as a reasonable value This reduces compu 
tdtioiial time and number of bits icquiied to encode, as a losult of increasing the 
eflicicncy of codei The numbei of blocks passed before encountering the significant 
eiioi block IS called Block-Run This is shown m figuie 6 8 After moving part the 
two ciioi blcKks the fust sigmfic uit erioi block is encountered and so is assigned 
a Block Run of two Smiiluly, cuh signihc iiit ciioi block is is&ociated with a 

Block Run as shown in figure 6 8 
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Sf r 


I Block wU Block B 24 


1 iguK 6 8 Euoi Blocks with Block Run 


Ihc followings fit^uie 6 9 shows the oiigiiial eiror image and modified eiior 


imag( with only sigiiifu iiit tiioi blocks 



I iguio 6 9 Oiigm \1 Erioi Image and Modified Error Image 


1 uh siKiiili' '"I >'>™ I-' <^“‘’><1 

Jiiw/mm follciudl l>y /ig/ 'P '!>' ^ ‘ 

a-i cxpl uacd in socUoiis 51 5 2 and 6 3 respectuely The folloa.ng figures explain 

this method di igi mi itic ilh whh lesults 
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6 3 2 IVIolioii paiaineters coding 


Motion p u unc tcis iic encoded using uniform quantization and fixed length coding 
W( UK illoc iting 8 bits foi c nfi motion pirainetc'i 

6 4 Estimated bit late 

Using the ibovc technique foi coding, we got an estimate of the bit rate for CLAIRE 
sequence is follows 

I' or each frame 
Aici vge nuinbti of legions = 50 
Avci igc niimbei of mmiiig icgions = 12 
Nuinbci of bits lequircd to code each moving region = 48 
Bits uquiud foi motion mloiiiutioii = 12 x 48 = 57C 
Avg niiinbci of oiior blocks = 15 

Avci \g( iiuiiibei ol bits iceiuiicd to code eiioi iiilouiutioii = 1100 
Flag bits foi iegioiis(movmg/static) = 50 
Hence, Nuinbci of bits/fiamc = 576+1100+50 = 1726 
Piauie rate = 7 5 fi uiics/sccoud 

lluicloic, ESTIMATED BIT RATE = 7 5 x 1726 = 13K (approximately) 


6 5 Discussion 

1 he results of simulation foi standaid sequences such as 1) CLAIRE 2)SALESMAN 
ind also foi llu sceiucncc tikcii m the hb titled as the "AUTHOR sequence” are 


shown in figures 6 11 6 40 On the left is the oiigiinl frame, and on right is the re 
coustuictcd ii viiK iftci motion compuisation pitdiction m the decoder Ihe PSNR 
obt lined is 38 34 foi dificient sequences indicated good quality of reconstructed 
nil igcs 


Mean scpiut eiioi (MSD) and signal to noise ratio (SNR) variations for 
Cl AIRL ind 6/l/L5'llMyVsequcnees aio shown in figures 6 41 6 40 The Variations 
in MSE and SNR with fiame number change accoidmg to the motion of objects in 
the couespoiidiiig fiames If the Piediction is poor, then the prediction error will 
be more and as each time the previous decoded fiame is segmented this error 
will aeeumulalo ovei the Iiamcs Consequently SNR decreases with the number of 
fiames 1 he SNR ilso degi ades when the motion between frames is large and include 
complex motion 

Change m SNR willi bit late foi standard CLAIRE and sales man sequences 
is shown m hgmes G 45 and G 4G 
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(a) Original (b) Reconstructed 

Figuio 6 11 Original and reconstructed CLAIRE Sequence Frame 1 


(a) Original 


(b) Reconstructed 


bgure 612 Original and reconstructed CLAIRE Sequence Frame 2 
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(i) Ongiiicil (b) Reconstructed 

Figure 6 15 Original and Reconstructed CLAIRE Sequence Frame 5 


(a) Original (b) Reconstructed 

Figuie 6 16 Original and Reconstructed CLAIRE Sequence Frame 6 
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(d) Original (b) Reconstructed 

Figiiie 6 19 Oiiginal and Reconstructed CLAIRE Sequence Frame 9 




(a) Ongmal 

Figure 6 20 Original and 


(b) Reconstructed 

Reconstructed CLAIRE Sequence Frame 10 
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(a) Onginal (b) Reconstructed 

Figure 6 21 Original and Reconstructed SALESMAN Sequence Frame 1 



(a) Oiiginal (b) Reconstructed 

Figure 6 22 Original and Reconstructed SALESMAN Sequence Frame 2 
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(a) Original (b) Reconstructed 

Figure 6 23 Original and Reconstructed SALESMAN Sequence Frame 3 



(a) Original (b) Reconstructed 

Figure 6 24 Original and Reconstructed SALESMAN Sequence Frame 4 
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Figure 6 25 Original and Reconstructed SALESMAN Sequence Frame 5 


(a) Original (b) Reconstructed 

Figure 6 26 Original and Reconstructed SALESMAN Sequence Rame 6 




! 


j 
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(a) Oiiginal (b) Reconstructed 

Figure 6 27 Original and Reconstructed Sequence Prame-7 


i 
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(a) Original (^) Reconstructed 

Figure 6 28 Original and Reconstructed SALESMAN Sequence Prame-8 
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(a) Original (b) Reconstructed 

Figure 6 29 Original and Reconstructed SALESMAN Sequence Fraine-9 



(a) Original (b) Reconstructed 

Figure 6 30 Original and Reconstructed SALESMAN Sequence Frame 10 
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(a) Original (b) Reconstructed 

Figure 6 31 Original and Reconstiucted A t/TFOii Sequence Frame-1 

] 



(a) Oiigmal (b) Reconstructed 

Figure 6 32 Original and Reconstructed A f/TFOR Sequence Praine-2 
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e Frame 3 


(a) Ongiiial 


(b) Reconstructed 


Figure 6 34 Original and Reconstiucted Sequence Frame-4 
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(a) Origin il (b) Reconstructed 

Figure 6 35 Original and Reconstructed A l7!rROii: Sequence Frame 6 


(a) Oiigmal (b) Reconstructed 

Figure 6 36 Original and Reconstructed A C^TiTOii Sequence Praine-6 
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( Original (b) Reconstructed 

Figure 6 37 Original and Reconstructed A t/TFOi? Sequence Frame 7 


(a) Original (b) Reconstructed 

Figure 6 38 Original and Reconstructed A Sequence Frame 8 
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(a) Oiigmal (b) Reconstructed 

Figure 6 40 Original and Reconstructed AUTHOR Sequence Piaine 10 





SgndksftaseRafcfm*) Mean scjaie error 



'2 





2 


F am© Number 


6 


8 


} 6 13 Mean sciu uc cnoi of the leconskucted frames of SALESMAN Seqxiev. 





agrilonaseratoOncS)) Signal to Noise Rat}o(in db) 



Fiftuic 0 45 Com ip uisiou ol SNR at Various Bit Rates for CLAIRE Sequence 



Figuu 6 46 Comapiision of SNR at Various Bit Rates for SALESMAN Sequence 





Chapter 7 


CONCLUSIONS AND SCOPE 
FOR FUTURE WORK 

7 1 Conclusions 

In this tlusis wo doscnbod tlio implementation of a low bit late video codec for a 
bit rate vaiymg between 10 to 20 Kbps It is a legion based teduuque employing 
morphological wateished algoiithm for segmentation Unlike other segmentation 
methods, wateished algorithm piovides accurate segmentation at low computational 
cost The ovei segmentation arising from applying a simple gradient image as input 
to the watershed algorithm is taken care of by using multiscale gradient followed 
by an algoiithm to eliminate small local minima In addition, the segmentation 
based method eliminates the pioblem of tiansmittmg laige amount of region shape 
information because this can be obtained by segmenting the reconstructed picture 
at both the ends 

The motion estimation algoiithm used m the design of the coder uses a novel 
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coiiKi dcttclioii ind ti ickint ippioadi lias nioLiou tsUiuation ipproadi is inudi 
more eflicicnt thin other suggested appioadies \\iudi use optimisation methods 
Cornel points are used to lepresent a legion and estimation of their motion, through 
tracking, is used to chai ictciize the motion of tlio icgioii This oIFers two major 
idv lilt ages lesser computational cost and lowei noise susceptibility 

The eiroi image obtained by subtiactiiig the motion compensated frame 
fiomllu tuiunthiuin is (udid using tin 1301 iln ciioi infoim ition of only those 
Significant eiioi blocks is sent for winch the error is greatei than a threshold The 
algonthrir devdoped w rs tested on first 10 frames of standard Clarre Salesman 
sequence as well as the Author sequence A data rate between 10 to 20 Kbps was 
obtained it i fi line i ite of 7 5 frames per second The SNR was of the order of 
31 38 (IB Ihe I o\\ bit i it( eodcr developed is lobusL md coinput itionally clhciciit 

7 2 Scope foi future work 


Ihe facgmentation piocess is coiitiolled by the pumieler li 1 ixing a value ol h 
depending on one sequence may not give very good segmentation for other sequences 
One may find an optimum value for li tluough experimentation Fixing up small 
values like 8 or 10 for h may be suitable for some sequences m which case we call the 
segmentation algoiitlim as uiisupei vised, but will not give optimum segmentation 
results for all scciuenee s This may lead to ovei segmentation As a result the moving 
regions will meiessc and ihe bit latc will go high So an automatic detection method 
for this constant h is needed in which case the segmentation algorithm will become 

fully unsnp Cl vised 

In motion cbtimiUon, comer tracking must have sufficient immunity to 
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oulhtis If lh( uuinbd ol oufhub \ie moic cspt tially when the motion is large 
between the fi lines least squaie method may not coneerge In such cases outliers 
cicate seiious pioblems vud motion estimation may fail Also efficient method is 
needed to fill the Uncoveied regions piodueed after motion compensation Extension 
of this appioach to ooloi sequences requires good segmention algorithm for color 
linages [6] 


1 
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Appendix- A 

Algorithm Fast Watersheds 


#defin 0 MASK -2 /* initial value of a threshold level */ 

#define WSHED 0 /* value of the pixels belonging to the watersheds*/ 
#defiiie INIT -1 

— INPUT imi decimal image 
— OUTPUT irao image of the labeled watersheds 

* INITIALIZATIONS 

Value INIT is assigned to each pixel of irao imo(p) = INIT 
current_label = 0 
current_dist integer variable 
- imd work image (of distances) , initalized to 0 

* Sort the pixels of imi, in the increasing order of 

their gray values 

Let h(min) and h(max) designate the lowest and highest 

values respectively 

* For h = h(rain) to h(max) { 

/* geodesic SKIZ of level h-1 inside level h */ 

For every pixel p such that imi(p) - h { 

/* These pixels are accessed directly through 

the sorted array +/ 
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irao(p) = MASK 

if there exists p’ in N(G)(p) such that imo(p ) 

> 0 or imo(p’) = WSHED 

{ 

irad(p) = 1 
f if o_add(p) 

} 

current_dist = 1 f ifo_add(f ictitious_pixel) 
repeat indefinitely { 
p = fifo„first() 
if p = f ictitious_pixel-{ 

if fifo_einpty() = true then BREAK 
else { f ifo_add(f ictitious„pixel) 
current.dist = current_dist + 1 
p » fifo.firstO , 

} 

} 

For every pixel p’ in N(G)(p) { 

if imd(p') < current _dist and (imo(p’) 

> 0 or imo(p’) = WSHED) { 

/* le e p’ belongs to an already 
labeled basin or to the Watersheds */ 

if imo(p') > 0 { 

if imo(p’) = MASK or imo(p) == WSHED then 

imo(p) = imo(pO 
else if imo(p) '= imo(p’) then 
imo(p) = WSHED 
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} 


else if iino(p) = MASK then imo(p) = WSHED 

} 

} 

} 

/+ checks if new minima have been discovered */ 

imd(p) =0 /* the distance associated with p is reset to 0 */ 

if (imd(p) = MASK { 

current_label = current_.label +1 
fifo_add(p) imo(p) = current_label 
while fifo_,empty() = false { 
p’ = fif o„f irstO , 

For every pixel p" in N(G)(p') { 

if imo(p") = MASK { fifo_add(p' ’) 
imo(p’') = current_label } 

} 

} 

} 

> 

} 
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Appendix-B 
VLC Tables for Coding 
the DCT Coefficients 


Table B 1 Huffman code table for 
luminance DC difference in JPEG 


( atof,oiy 

code length 

Codcwoid 

0 

2 

00 

1 

1 

010 

2 

3 

on 

3 

3 

100 

4 

3 

101 

5 

3 

no 

6 

4 

1110 

7 

5 

11110 

8 

6 

111110 

9 

7 

iinno 

10 

8 

11111110 

11 

9 

111111110 
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Table B 2 VLC Table for Luminance AC coefficients in JPEG 


Run 

Level 

Codeword 

Run 

Level 

Codeword 

0 

0 

1010 

3 

5 

iiniiiiiuoiuooo 

0 

1 

00 

3 

G 

1111111110010001 

0 

2 

01 

3 

7 

1111111110010010 

0 

3 

100 

3 

7 

1111111110010010 

0 

4 

1011 

3 

8 

1111111110010011 

0 

5 

11010 

3 

9 

1111111110010100 

0 

0 

1111000 

4 

1 

111011 

0 

7 

11111000 

4 

2 

lllllllOOO 

0 

8 

1111110110 

4 

3 

1111111110010110 

0 

9 

1111111110000010 

4 

4 

1111111110010111 

0 

10 

1111111110000011 

4 

5 

1111111110011000 

1 

1 

1100 

4 

G 

ninmiooiiooi 

1 

2 

non 

4 

7 

1111111110011010 

1 

3 

n 11001 

4 

8 

1111111110011011 

1 

4 

111110110 

4 

9 

1111111110011100 

1 

5 

innnoiio 

4 

10 

1111111110011101 

1 

6 

111111 1110000100 

5 

1 

1111010 

1 

7 

1111111110000101 

5 

2 

11111110111 

1 

8 

1111111110000110 

5 

3 

1111111110011110 

1 

9 

niiiniiooooiii 

5 

4 

1111111110011111 
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Run 

Level 

Codeword 

Run 

Level 

Codeword 

1 

10 

1111111110001000 

5 

5 

1111111110010000 

2 

1 

11100 

5 

6 

1111111110010001 

2 

2 

11111001 

5 

7 

1111111110010010 

2 

3 

1111110111 

5 

8 

lllllllllOOlOOll 

2 

4 

111111110100 

5 

9 

1111111110010100 

2 

5 

1111111110001001 

5 

10 

1111111110010101 

2 

6 

1111111110001010 

6 

1 

1111011 

2 

7 

1111111110001011 

6 

2 

111111110110 

2 

8 

1111111110001100 

6 

3 

1111111110100110 

2 

9 

1111111110001101 

6 

4 

1111111110100111 

2 

10 

1111111110001110 

6 

5 

1111111110101000 

3 

1 

111010 

6 

6 

1111111110101001 

3 

2 

111110111 

6 

7 

lllllllllOlOlOlO 

3 

3 

111111110101 

G 

8 

1111111110101011 

3 

4 

niiiiiiioooiiii 

G 

9 

1111111110101100 

6 

10 

1111111110101101 

10 

0 

llllllllilOOiOll 

7 

1 

11111010 

10 

7 

1111111111001100 

7 

2 

111111110111 

10 

8 

1111111111001101 

7 

3 

1111111110101110 

10 

9 

1111111111001110 

7 

4 

1111111110101111 

10 

10 

1111111111001111 

7 

5 

1111111110110000 

11 

1 

1111111001 

7 

0 

1111111110110001 

11 

2 

llllllllllOlOOOO 

7 

7 

1111111110110010 

11 

3 

llllllllllOlOOOl 

7 

8 

1111111110110011 

11 

4 

1111111111010010 

7 

9 

1111111110110100 

11 

5 

1111111111010011 

7 

10 

1111111110110101 

11 

G 

iiiiiimioioloo 

8 

1 

111111000 

11 

7 

iiiiiniiioioioi 

8 

2 

111111111000000 

11 

8 

1111111111010110 

8 

3 

1111111110110110 

11 

9 

1111111111010111 

8 

4 

1111111110110111 

11 

10 

miiiiiiioiiooo 
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Run 

Level 

Codeword 

Run 

Level 

Codeword 

8 

5 

1111111110111000 

12 

1 

1111111010 

8 

0 

1111111110111001 

12 

2 

1111111111011001 

8 

7 

1111111110111010 

12 

3 

1111111111011010 

8 

8 

1111111110111011 

12 

4 

1111111111011011 

8 

9 

1111111110111100 

12 

5 

1111111111011100 

8 

10 

1111111110111101 

12 

6 

1111111111011101 

9 

1 

111111001 

12 

7 

1111111111011110 

9 

2 

1111111110111110 

12 

8 

1111111111011111 

9 

3 

1111111110111111 

12 

9 

1111111111100000 

9 

4 

1111111111000000 

12 

10 

1111111111100001 

9 

5 

1111111111000001 

13 

1 

11111111000 

9 

6 

1111111111000010 

13 

2 

1111111111100010 

9 

7 

1111111111000011 

13 

3 

1111111111100011 

9 

8 

1111111111000100 

13 

4 

1111111111100100 

9 

9 

1111111111000101 

13 

5 

1111111111100101 

9 

10 

1111111111000110 

13 

C 

1111111111100110 

10 

1 

111111010 

13 

7 

1111111111100111 

10 

2 

1111111111000111 

13 

8 

1111111111101000 

10 

3 

1111111111001000 

13 

9 

1111111111101001 

10 

4 

1111111111001001 

13 

10 

1111111111101010 

10 

5 

1111111111001010 

14 

1 

1111111111101011 

14 

2 

1111111111101100 

15 

1 

llllllllllilOlOl 

14 

3 

1111111111101101 

15 

2 

1111111111110110 

14 

1 

1111111111101110 

15 

3 

1111111111110111 

14 

5 

1111111111101111 

15 

4 

1111111111111000 

14 

G 

1111111111110000 

15 

5 

1111111111111001 

14 

7 

1111111111110001 

15 

6 

1111111111111010 

14 

8 

1111111111110010 

15 

7 

1111111111111011 

14 

9 

1111111111110011 

15 

8 

1111111111111100 

14 

10 

ininiiiiiioioo 

15 

9 

1111111111111101 

15 

0 

11111111001 

15 

10 

1111111111111110 
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